An Inheritance-Based Description of Bulgarian Noun Inflection

نویسندگان

  • Katina Bontcheva
  • James Kilbury
چکیده

This paper discusses an analysis of Bulgarian noun inflection that uses non-monotonic inheritance and abstract morphophonemic representation. The analysis is encoded in the lexical knowledge representation language DATR. In this discussion, we approach Bulgarian noun inflection in terms close to those of Network Morphology (cf. Corbett & Fraser (1993)) and related theories of inflection. Thus, an essential feature of the analysis is that inflectional types are represented as objects within a nonmonotonic inheritance hierarchy that gives an explicit account of regularities, subregularities and exceptions through highly constrained lexical entries, default inflectional class(es) and non-productive class(es). Our goal has been to make a precise and explicit description allowing economical lexical entries. We decided to use abstract morphophonemic representations to simplify the description of alternations and the morphotactics. For example, the phoneme i has at least three sources: i1, i2 and ы. The i that is not involved in morphophonemic alternations is transliterated as i, while the i that triggers 2 palatalization is marked in the lexicon as 2 but surfaces as i in the generated forms. This may look like diachronic description but is not; rather our description is synchronic and applicationoriented since the use of deep morphophonemic representations is 1 For details on the transcription cf. the table in Appendix 1. always motivated by synchronic alternations. Nouns in Bulgarian have two inflectional categories – number and definiteness, and one structural category – gender. The most important elements through which the paradigmatic oppositions are described are the base, sg stem suffixes (suff sg), plural stem suffixes (suff pl), inflections (flex pl) and articles. The base of a noun is the part that is left after the inflectional elements such as singular and plural stem suffixes, endings and articles have been removed. Thus, the base in Bulgarian consists of optional prefixes, root(s), and optional suffixes. For example, the word uchilishte ‘school’ splits into the base uchilisht and the sg stem suffix e. The suffix isht belongs to the base as it appears in all forms of the lexeme. The same suffix however, may appear as an inflectional element, e.g. a plural stem suffix, cf. kup ‘heap’ – kupisht-a ‘heaps’ where a is the plural ending and isht is a plural stem suffix. The sg stem suffixes define the gender of the noun. Every noun in Bulgarian belongs to one of the three genders masculine 2 For a detailed description of nominal number systems cf. Corbett (2000).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Memory-based Learning Models of Inflectional Morphology: a Methodological Case Study

The paper investigates the memory-based learning (MBL) paradigm as a model of productive linguistic behavior in the domain of Dutch noun plural inflection. We first sketch the origin and background of the MBL approach, to then provide a short overview of Dutch noun plural inflection along with a detailed description of the use of MBL models for inflectional morphology. Results of a large number...

متن کامل

A FST Description of Noun and Verb Morphology of Azarbaijani Turkish

We give a FST description of nominal and finite verb morphology of Azarbaijani Turkish. We use a hybrid approach where nominal inflection is expressed as a slot-based paradigm and major parts of verb inflection are expressed as optional paths on the FST. We collapse adjective and noun categories in a single nominal category as they behave similarly as far as their paradigms are concerned. Thus,...

متن کامل

On Detecting Noun-Adjective Agreement Errors in Bulgarian Language Using GATE

In this article, we describe an approach for automatic detection of noun-adjective agreement errors in Bulgarian texts by explaining the necessary steps required to develop a simple Java-based language processing application. For this purpose, we use the GATE language processing framework [9], which is capable of analyzing texts in Bulgarian language and can be embedded in software applications...

متن کامل

Bulgarian Inflectional Morphology in Universal Networking Language

The paper presents a web-based application of semantic networks to model Bulgarian inflectional morphology. It demonstrates the general ideas, principles, and problems of inflectional grammar knowledge representation used for encoding Bulgarian inflectional morphology in Universal Networking Language (UNL). The analysis of UNL formalism is outlined in terms of its expressive power to present in...

متن کامل

Extraction of Definitions for Bulgarian

We participated at the Monolingual Bulgarian QA task at CLEF-2006 with a definition extraction system based on linguistic templates and keywords. Our system uses a partial syntactic parser for Bulgarian to detect noun phrases as candidates for definitions. Our system answered correctly to 28% of the definition questions.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003